Czech language database of car speech and environmental noise
نویسندگان
چکیده
This paper will present new Czech language twochannel (stereo) speech database recorded in car environment. The created database was designed for experiments with speech enhancement for communication purposes and for the study and the design of a robust speech recognition systems. It respects car noise environment which is currently at the top of the interest. Tools for automated phoneme labelling based on Baum-Welch re-estimation were designed. The noise analysis of the car background environment was done.
منابع مشابه
Comparison of Three Czech Speech Databases from the Standpoint of Lombard Effect Appearance
This paper focuses on three Czech speech databases recorded in actual and simulated noisy conditions and explores their suitability for LE analysis and modeling. Parameters of Czech SPEECON, CZKCC car database and newly established Czech Lombard Speech Database (CLSD) are compared. All three databases comprise speech recorded in neutral conditions and speech uttered in noise of the moving car. ...
متن کاملSpeech Recognition in the Automobile
Acknowledgments Chapter 1: Introduction Chapter 2: The SPHINX Speech Recognition System 1 2 3 5 2.1 Signal Processing ............................ 5 2.2 Clustering and Vector Quantization ..................... 6 2.3 Hidden Markov Models .......................... 7 2.4 Speech Unit ............................... 7 Chapter 3: The Motorola Car Database and AN4 Database 8 3.1 The Motorola Car Data...
متن کاملDesign and collection of Czech Lombard speech database
In this paper, design, collection and parameters of newly proposed Czech Lombard Speech Database (CLSD) are presented. The database focuses on analysis and modeling of Lombard effect to achieve robust speech recognition improvement. The CLSD consists of neutral speech and speech produced in various types of simulated noisy background. In comparison to available databases dealing with Lombard ef...
متن کاملReduced complexity equalization of lombard effect for speech recognition in noisy adverse environments
In real-world adverse environments, speech signal corruption by background noise, microphone channel variations, and speech production adjustments introduced by speakers in an effort to communicate efficiently over noise (Lombard effect) severely impact automatic speech recognition (ASR) performance. Recently, a set of unsupervised techniques reducing ASR sensitivity to these sources of distort...
متن کاملSPEECHDAT-CAR. A Large Speech Database for Automotive Environments
The aims of the SpeechDat-Car project are to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. As a result, a total of ten (10) equivalent and similar resources will be created. The 10 languages are Danish, each language 600 sessions will be recorded (from at least 300 speakers) in seven characteristic envir...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999